Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 41533 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.0 MiB |
| Average record size in memory | 176.0 B |
Variable types
| NUM | 11 |
|---|---|
| BOOL | 6 |
| CAT | 5 |
city_name has a high cardinality: 1047 distinct values | High cardinality |
region is highly correlated with province | High correlation |
province is highly correlated with region | High correlation |
house_area is highly skewed (γ1 = 121.9365874) | Skewed |
garden_area is highly skewed (γ1 = 108.5190241) | Skewed |
surface_of_the_land is highly skewed (γ1 = 53.81642955) | Skewed |
Unnamed: 0 has unique values | Unique |
terrace_area has 25193 (60.7%) zeros | Zeros |
garden_area has 33596 (80.9%) zeros | Zeros |
surface_of_the_land has 21205 (51.1%) zeros | Zeros |
number_of_facades has 10404 (25.0%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-17 22:09:15.588396 |
|---|---|
| Analysis finished | 2020-09-17 22:09:45.401536 |
| Duration | 29.81 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 41533 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25731.71478 |
|---|---|
| Minimum | 0 |
| Maximum | 52075 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2537.6 |
| Q1 | 12558 |
| median | 25529 |
| Q3 | 38774 |
| 95-th percentile | 49447.4 |
| Maximum | 52075 |
| Range | 52075 |
| Interquartile range (IQR) | 26216 |
Descriptive statistics
| Standard deviation | 15100.07888 |
|---|---|
| Coefficient of variation (CV) | 0.5868275399 |
| Kurtosis | -1.209303272 |
| Mean | 25731.71478 |
| Median Absolute Deviation (MAD) | 13126 |
| Skewness | 0.02404907666 |
| Sum | 1068715310 |
| Variance | 228012382.2 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 50610 | 1 | < 0.1% | |
| 15821 | 1 | < 0.1% | |
| 13772 | 1 | < 0.1% | |
| 3531 | 1 | < 0.1% | |
| 1482 | 1 | < 0.1% | |
| 7625 | 1 | < 0.1% | |
| 5576 | 1 | < 0.1% | |
| 26054 | 1 | < 0.1% | |
| 30148 | 1 | < 0.1% | |
| Other values (41523) | 41523 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 52075 | 1 | < 0.1% | |
| 52073 | 1 | < 0.1% | |
| 52072 | 1 | < 0.1% | |
| 52071 | 1 | < 0.1% | |
| 52070 | 1 | < 0.1% |
postal_code
Real number (ℝ≥0)
| Distinct | 1058 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5173.781475 |
|---|---|
| Minimum | 1000 |
| Maximum | 9992 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1080 |
| Q1 | 2322 |
| median | 4620 |
| Q3 | 8380 |
| 95-th percentile | 9402 |
| Maximum | 9992 |
| Range | 8992 |
| Interquartile range (IQR) | 6058 |
Descriptive statistics
| Standard deviation | 2977.364009 |
|---|---|
| Coefficient of variation (CV) | 0.5754715431 |
| Kurtosis | -1.511348912 |
| Mean | 5173.781475 |
| Median Absolute Deviation (MAD) | 2835 |
| Skewness | 0.09705636893 |
| Sum | 214882666 |
| Variance | 8864696.443 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 8300 | 757 | 1.8% | |
| 8400 | 705 | 1.7% | |
| 9000 | 652 | 1.6% | |
| 1180 | 519 | 1.2% | |
| 1000 | 473 | 1.1% | |
| 8370 | 452 | 1.1% | |
| 4000 | 433 | 1.0% | |
| 8670 | 367 | 0.9% | |
| 1050 | 345 | 0.8% | |
| 2000 | 336 | 0.8% | |
| Other values (1048) | 36494 | 87.9% |
| Value | Count | Frequency (%) | |
| 1000 | 473 | 1.1% | |
| 1020 | 139 | 0.3% | |
| 1030 | 333 | 0.8% | |
| 1040 | 153 | 0.4% | |
| 1050 | 345 | 0.8% |
| Value | Count | Frequency (%) | |
| 9992 | 5 | < 0.1% | |
| 9991 | 14 | < 0.1% | |
| 9990 | 49 | 0.1% | |
| 9988 | 10 | < 0.1% | |
| 9982 | 2 | < 0.1% |
| Distinct | 1047 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| Antwerpen | 929 |
|---|---|
| Knokke | 757 |
| Oostende | 705 |
| Gent | 652 |
| Uccle | 519 |
| Other values (1042) |
| Value | Count | Frequency (%) | |
| Antwerpen | 929 | 2.2% | |
| Knokke | 757 | 1.8% | |
| Oostende | 705 | 1.7% | |
| Gent | 652 | 1.6% | |
| Uccle | 519 | 1.2% | |
| Bruxelles | 473 | 1.1% | |
| Uitkerke | 452 | 1.1% | |
| Glain | 433 | 1.0% | |
| Wulpen | 367 | 0.9% | |
| Ixelles | 345 | 0.8% | |
| Other values (1037) | 35901 | 86.4% |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | 0.2% |
Length
| Max length | 30 |
|---|---|
| Median length | 8 |
| Mean length | 8.570413888 |
| Min length | 2 |
type_of_property
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 22294 | 53.7% | |
| 1 | 19239 | 46.3% |
subtype_of_property
Categorical
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| house | |
|---|---|
| apartment | |
| villa | |
| duplex | 1274 |
| ground floor | 1093 |
| Other values (17) |
| Value | Count | Frequency (%) | |
| house | 16596 | 40.0% | |
| apartment | 15251 | 36.7% | |
| villa | 2374 | 5.7% | |
| duplex | 1274 | 3.1% | |
| ground floor | 1093 | 2.6% | |
| penthouse | 823 | 2.0% | |
| apartment block | 724 | 1.7% | |
| mixed use building | 679 | 1.6% | |
| mansion | 393 | 0.9% | |
| exceptional property | 381 | 0.9% | |
| Other values (12) | 1945 | 4.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 7.527147088 |
| Min length | 3 |
price
Real number (ℝ≥0)
| Distinct | 3535 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 315395.0649 |
|---|---|
| Minimum | 2500 |
| Maximum | 950000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 2500 |
|---|---|
| 5-th percentile | 120000 |
| Q1 | 199000 |
| median | 275000 |
| Q3 | 380000 |
| 95-th percentile | 689000 |
| Maximum | 950000 |
| Range | 947500 |
| Interquartile range (IQR) | 181000 |
Descriptive statistics
| Standard deviation | 169480.0016 |
|---|---|
| Coefficient of variation (CV) | 0.5373578107 |
| Kurtosis | 1.884328684 |
| Mean | 315395.0649 |
| Median Absolute Deviation (MAD) | 85000 |
| Skewness | 1.362081539 |
| Sum | 1.309930323e+10 |
| Variance | 2.872347093e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 249000 | 570 | 1.4% | |
| 199000 | 562 | 1.4% | |
| 299000 | 560 | 1.3% | |
| 295000 | 540 | 1.3% | |
| 225000 | 534 | 1.3% | |
| 275000 | 527 | 1.3% | |
| 325000 | 439 | 1.1% | |
| 235000 | 428 | 1.0% | |
| 175000 | 428 | 1.0% | |
| 395000 | 424 | 1.0% | |
| Other values (3525) | 36521 | 87.9% |
| Value | Count | Frequency (%) | |
| 2500 | 3 | < 0.1% | |
| 6600 | 1 | < 0.1% | |
| 8160 | 1 | < 0.1% | |
| 9999 | 1 | < 0.1% | |
| 10000 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 950000 | 76 | 0.2% | |
| 949000 | 9 | < 0.1% | |
| 948000 | 2 | < 0.1% | |
| 947000 | 3 | < 0.1% | |
| 945000 | 36 | 0.1% |
number_of_rooms
Real number (ℝ≥0)
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.848168926 |
|---|---|
| Minimum | 1 |
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 30 |
| Range | 29 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.360592664 |
|---|---|
| Coefficient of variation (CV) | 0.4777078534 |
| Kurtosis | 21.51819673 |
| Mean | 2.848168926 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.504196405 |
| Sum | 118293 |
| Variance | 1.851212396 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 14062 | 33.9% | |
| 3 | 13563 | 32.7% | |
| 4 | 5935 | 14.3% | |
| 1 | 4314 | 10.4% | |
| 5 | 2148 | 5.2% | |
| 6 | 888 | 2.1% | |
| 7 | 278 | 0.7% | |
| 8 | 144 | 0.3% | |
| 9 | 62 | 0.1% | |
| 10 | 57 | 0.1% | |
| Other values (12) | 82 | 0.2% |
| Value | Count | Frequency (%) | |
| 1 | 4314 | 10.4% | |
| 2 | 14062 | 33.9% | |
| 3 | 13563 | 32.7% | |
| 4 | 5935 | 14.3% | |
| 5 | 2148 | 5.2% |
| Value | Count | Frequency (%) | |
| 30 | 2 | < 0.1% | |
| 24 | 2 | < 0.1% | |
| 23 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 20 | 2 | < 0.1% |
| Distinct | 677 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 155.1154022 |
|---|---|
| Minimum | 1 |
| Maximum | 31700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 92 |
| median | 130 |
| Q3 | 187 |
| 95-th percentile | 330 |
| Maximum | 31700 |
| Range | 31699 |
| Interquartile range (IQR) | 95 |
Descriptive statistics
| Standard deviation | 184.0280121 |
|---|---|
| Coefficient of variation (CV) | 1.186394191 |
| Kurtosis | 20792.6004 |
| Mean | 155.1154022 |
| Median Absolute Deviation (MAD) | 43 |
| Skewness | 121.9365874 |
| Sum | 6442408 |
| Variance | 33866.30925 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 90 | 906 | 2.2% | |
| 120 | 897 | 2.2% | |
| 100 | 888 | 2.1% | |
| 150 | 835 | 2.0% | |
| 140 | 768 | 1.8% | |
| 80 | 750 | 1.8% | |
| 110 | 713 | 1.7% | |
| 200 | 711 | 1.7% | |
| 160 | 695 | 1.7% | |
| 130 | 672 | 1.6% | |
| Other values (667) | 33698 | 81.1% |
| Value | Count | Frequency (%) | |
| 1 | 4 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 13 | 2 | < 0.1% | |
| 14 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 31700 | 1 | < 0.1% | |
| 3560 | 1 | < 0.1% | |
| 2400 | 1 | < 0.1% | |
| 2019 | 1 | < 0.1% | |
| 1700 | 1 | < 0.1% |
fully_equipped_kitchen
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 28961 | 69.7% | |
| 0 | 12572 | 30.3% |
open_fire
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 0 | |
|---|---|
| 1 | 2193 |
| Value | Count | Frequency (%) | |
| 0 | 39340 | 94.7% | |
| 1 | 2193 | 5.3% |
terrace
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 25588 | 61.6% | |
| 0 | 15945 | 38.4% |
| Distinct | 183 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.383381889 |
|---|---|
| Minimum | 0 |
| Maximum | 1150 |
| Zeros | 25193 |
| Zeros (%) | 60.7% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12 |
| 95-th percentile | 42 |
| Maximum | 1150 |
| Range | 1150 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 22.41383944 |
|---|---|
| Coefficient of variation (CV) | 2.388673903 |
| Kurtosis | 392.0609731 |
| Mean | 9.383381889 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.43972243 |
| Sum | 389720 |
| Variance | 502.3801986 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 25193 | 60.7% | |
| 10 | 1002 | 2.4% | |
| 20 | 959 | 2.3% | |
| 15 | 782 | 1.9% | |
| 12 | 736 | 1.8% | |
| 6 | 706 | 1.7% | |
| 8 | 701 | 1.7% | |
| 30 | 654 | 1.6% | |
| 25 | 583 | 1.4% | |
| 9 | 555 | 1.3% | |
| Other values (173) | 9662 | 23.3% |
| Value | Count | Frequency (%) | |
| 0 | 25193 | 60.7% | |
| 1 | 72 | 0.2% | |
| 2 | 278 | 0.7% | |
| 3 | 353 | 0.8% | |
| 4 | 518 | 1.2% |
| Value | Count | Frequency (%) | |
| 1150 | 1 | < 0.1% | |
| 1020 | 1 | < 0.1% | |
| 761 | 1 | < 0.1% | |
| 708 | 1 | < 0.1% | |
| 600 | 2 | < 0.1% |
garden
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 28250 | 68.0% | |
| 1 | 13283 | 32.0% |
| Distinct | 1155 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135.9679773 |
|---|---|
| Minimum | 0 |
| Maximum | 312600 |
| Zeros | 33596 |
| Zeros (%) | 80.9% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 500 |
| Maximum | 312600 |
| Range | 312600 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1947.680351 |
|---|---|
| Coefficient of variation (CV) | 14.32455193 |
| Kurtosis | 16252.3201 |
| Mean | 135.9679773 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 108.5190241 |
| Sum | 5647158 |
| Variance | 3793458.75 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 33596 | 80.9% | |
| 100 | 243 | 0.6% | |
| 200 | 214 | 0.5% | |
| 50 | 167 | 0.4% | |
| 300 | 165 | 0.4% | |
| 150 | 156 | 0.4% | |
| 60 | 135 | 0.3% | |
| 400 | 134 | 0.3% | |
| 500 | 124 | 0.3% | |
| 30 | 123 | 0.3% | |
| Other values (1145) | 6476 | 15.6% |
| Value | Count | Frequency (%) | |
| 0 | 33596 | 80.9% | |
| 1 | 61 | 0.1% | |
| 2 | 4 | < 0.1% | |
| 3 | 3 | < 0.1% | |
| 4 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 312600 | 1 | < 0.1% | |
| 88800 | 1 | < 0.1% | |
| 85000 | 1 | < 0.1% | |
| 63000 | 1 | < 0.1% | |
| 58000 | 1 | < 0.1% |
| Distinct | 2963 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 537.4441047 |
|---|---|
| Minimum | 0 |
| Maximum | 400000 |
| Zeros | 21205 |
| Zeros (%) | 51.1% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 406 |
| 95-th percentile | 1811.4 |
| Maximum | 400000 |
| Range | 400000 |
| Interquartile range (IQR) | 406 |
Descriptive statistics
| Standard deviation | 3561.331513 |
|---|---|
| Coefficient of variation (CV) | 6.626422137 |
| Kurtosis | 4785.302185 |
| Mean | 537.4441047 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.81642955 |
| Sum | 22321666 |
| Variance | 12683082.14 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 21205 | 51.1% | |
| 150 | 183 | 0.4% | |
| 200 | 170 | 0.4% | |
| 100 | 164 | 0.4% | |
| 250 | 154 | 0.4% | |
| 300 | 151 | 0.4% | |
| 1000 | 146 | 0.4% | |
| 120 | 141 | 0.3% | |
| 400 | 122 | 0.3% | |
| 600 | 118 | 0.3% | |
| Other values (2953) | 18979 | 45.7% |
| Value | Count | Frequency (%) | |
| 0 | 21205 | 51.1% | |
| 1 | 23 | 0.1% | |
| 2 | 2 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 400000 | 1 | < 0.1% | |
| 264781 | 1 | < 0.1% | |
| 120300 | 1 | < 0.1% | |
| 120000 | 2 | < 0.1% | |
| 117800 | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.064406616 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 10404 |
| Zeros (%) | 25.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.40590375 |
|---|---|
| Coefficient of variation (CV) | 0.6810207538 |
| Kurtosis | -1.089578788 |
| Mean | 2.064406616 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.2304527708 |
| Sum | 85741 |
| Variance | 1.976565354 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 14995 | 36.1% | |
| 0 | 10404 | 25.0% | |
| 4 | 8171 | 19.7% | |
| 3 | 7552 | 18.2% | |
| 1 | 411 | 1.0% |
| Value | Count | Frequency (%) | |
| 0 | 10404 | 25.0% | |
| 1 | 411 | 1.0% | |
| 2 | 14995 | 36.1% | |
| 3 | 7552 | 18.2% | |
| 4 | 8171 | 19.7% |
| Value | Count | Frequency (%) | |
| 4 | 8171 | 19.7% | |
| 3 | 7552 | 18.2% | |
| 2 | 14995 | 36.1% | |
| 1 | 411 | 1.0% | |
| 0 | 10404 | 25.0% |
swimming_pool
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| 0 | |
|---|---|
| 1 | 704 |
| Value | Count | Frequency (%) | |
| 0 | 40829 | 98.3% | |
| 1 | 704 | 1.7% |
state_of_the_building
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| as new | |
|---|---|
| good | |
| None | |
| to be done up | |
| to renovate | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| as new | 12312 | 29.6% | |
| good | 11341 | 27.3% | |
| None | 10060 | 24.2% | |
| to be done up | 2916 | 7.0% | |
| to renovate | 2531 | 6.1% | |
| just renovated | 2226 | 5.4% | |
| to restore | 147 | 0.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 4 |
| Mean length | 6.208532974 |
| Min length | 4 |
lattitude
Real number (ℝ≥0)
| Distinct | 1052 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.318006792 |
|---|---|
| Minimum | 2.580669689 |
| Maximum | 6.3009381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 2.580669689 |
|---|---|
| 5-th percentile | 2.9203275 |
| Q1 | 3.7463234 |
| median | 4.3667216 |
| Q3 | 4.849314652 |
| 95-th percentile | 5.622980506 |
| Maximum | 6.3009381 |
| Range | 3.720268411 |
| Interquartile range (IQR) | 1.102991252 |
Descriptive statistics
| Standard deviation | 0.8096614069 |
|---|---|
| Coefficient of variation (CV) | 0.1875081365 |
| Kurtosis | -0.6412711046 |
| Mean | 4.318006792 |
| Median Absolute Deviation (MAD) | 0.5585481 |
| Skewness | -0.07791596416 |
| Sum | 179339.7761 |
| Variance | 0.6555515938 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.3997081 | 929 | 2.2% | |
| 3.323373861 | 757 | 1.8% | |
| 2.9203275 | 705 | 1.7% | |
| 3.7141549 | 652 | 1.6% | |
| 4.3372348 | 519 | 1.2% | |
| 4.351697 | 473 | 1.1% | |
| 3.14048681 | 452 | 1.1% | |
| 5.541864 | 433 | 1.0% | |
| 2.707311916 | 367 | 0.9% | |
| 4.3815707 | 345 | 0.8% | |
| Other values (1042) | 35901 | 86.4% |
| Value | Count | Frequency (%) | |
| 2.580669689 | 238 | 0.6% | |
| 2.6262588 | 7 | < 0.1% | |
| 2.64344877 | 44 | 0.1% | |
| 2.644911715 | 2 | < 0.1% | |
| 2.673321 | 8 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6.3009381 | 2 | < 0.1% | |
| 6.2642498 | 1 | < 0.1% | |
| 6.257827 | 8 | < 0.1% | |
| 6.2053573 | 5 | < 0.1% | |
| 6.1884932 | 3 | < 0.1% |
longitude
Real number (ℝ≥0)
| Distinct | 1052 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.85129752 |
|---|---|
| Minimum | 49.5085018 |
| Maximum | 51.4743516 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 324.5 KiB |
Quantile statistics
| Minimum | 49.5085018 |
|---|---|
| 5-th percentile | 50.3102184 |
| Q1 | 50.666357 |
| median | 50.8695429 |
| Q3 | 51.09779175 |
| 95-th percentile | 51.2996935 |
| Maximum | 51.4743516 |
| Range | 1.9658498 |
| Interquartile range (IQR) | 0.43143475 |
Descriptive statistics
| Standard deviation | 0.3259065571 |
|---|---|
| Coefficient of variation (CV) | 0.006409011627 |
| Kurtosis | 1.455878902 |
| Mean | 50.85129752 |
| Median Absolute Deviation (MAD) | 0.2213379 |
| Skewness | -0.9548716278 |
| Sum | 2112006.94 |
| Variance | 0.106215084 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 51.2211097 | 929 | 2.2% | |
| 51.34942965 | 757 | 1.8% | |
| 51.2303177 | 705 | 1.7% | |
| 51.0397129 | 652 | 1.6% | |
| 50.8018201 | 519 | 1.2% | |
| 50.8465573 | 473 | 1.1% | |
| 51.2996935 | 452 | 1.1% | |
| 50.648205 | 433 | 1.0% | |
| 51.09779175 | 367 | 0.9% | |
| 50.8222854 | 345 | 0.8% | |
| Other values (1042) | 35901 | 86.4% |
| Value | Count | Frequency (%) | |
| 49.5085018 | 11 | < 0.1% | |
| 49.5577562 | 11 | < 0.1% | |
| 49.5580794 | 14 | < 0.1% | |
| 49.5581925 | 2 | < 0.1% | |
| 49.5642065 | 17 | < 0.1% |
| Value | Count | Frequency (%) | |
| 51.4743516 | 34 | 0.1% | |
| 51.4677957 | 10 | < 0.1% | |
| 51.46092495 | 6 | < 0.1% | |
| 51.45063155 | 29 | 0.1% | |
| 51.43155825 | 4 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| Flandre-Occidentale | |
|---|---|
| Anvers | |
| Flandre-Orientale | |
| Hainaut | |
| Liège | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| Flandre-Occidentale | 7322 | 17.6% | |
| Anvers | 5455 | 13.1% | |
| Flandre-Orientale | 5204 | 12.5% | |
| Hainaut | 4287 | 10.3% | |
| Liège | 4133 | 10.0% | |
| Bruxelles-Capitale | 4087 | 9.8% | |
| Brabant flamand | 3845 | 9.3% | |
| Limbourg | 2609 | 6.3% | |
| Brabant wallon | 1796 | 4.3% | |
| Namur | 1620 | 3.9% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 12.2335733 |
| Min length | 5 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 324.5 KiB |
| Flandre | |
|---|---|
| Wallonie | |
| Bruxelles |
| Value | Count | Frequency (%) | |
| Flandre | 24435 | 58.8% | |
| Wallonie | 13011 | 31.3% | |
| Bruxelles | 4087 | 9.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.510076325 |
| Min length | 7 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | postal_code | city_name | type_of_property | subtype_of_property | price | number_of_rooms | house_area | fully_equipped_kitchen | open_fire | terrace | terrace_area | garden | garden_area | surface_of_the_land | number_of_facades | swimming_pool | state_of_the_building | lattitude | longitude | province | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1050 | Ixelles | 0 | house | 340000.0 | 6 | 203 | 1 | 0 | 1 | 0 | 0 | 0 | 95 | 2 | 0 | to be done up | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 1 | 1 | 1050 | Ixelles | 0 | mixed use building | 520000.0 | 4 | 200 | 0 | 0 | 0 | 0 | 0 | 0 | 69 | 2 | 0 | to renovate | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 2 | 3 | 1050 | Ixelles | 0 | house | 599000.0 | 4 | 160 | 1 | 0 | 1 | 0 | 1 | 55 | 100 | 2 | 0 | to be done up | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 3 | 4 | 1050 | Ixelles | 0 | house | 599000.0 | 3 | 160 | 1 | 0 | 1 | 15 | 1 | 60 | 130 | 2 | 0 | good | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 4 | 5 | 1050 | Ixelles | 0 | house | 575000.0 | 3 | 171 | 0 | 0 | 0 | 0 | 0 | 0 | 46 | 2 | 0 | just renovated | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 5 | 6 | 1050 | Ixelles | 0 | house | 590000.0 | 4 | 225 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | to renovate | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 6 | 7 | 1050 | Ixelles | 0 | house | 575000.0 | 4 | 209 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | None | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 7 | 8 | 1050 | Ixelles | 0 | other property | 595000.0 | 1 | 195 | 1 | 1 | 1 | 0 | 1 | 0 | 617 | 4 | 0 | as new | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 8 | 9 | 1050 | Ixelles | 0 | house | 595777.0 | 4 | 250 | 0 | 0 | 0 | 0 | 0 | 0 | 70 | 2 | 0 | None | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 9 | 11 | 1050 | Ixelles | 0 | house | 650000.0 | 6 | 250 | 1 | 0 | 0 | 0 | 0 | 0 | 60 | 2 | 0 | good | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
Last rows
| Unnamed: 0 | postal_code | city_name | type_of_property | subtype_of_property | price | number_of_rooms | house_area | fully_equipped_kitchen | open_fire | terrace | terrace_area | garden | garden_area | surface_of_the_land | number_of_facades | swimming_pool | state_of_the_building | lattitude | longitude | province | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41523 | 52063 | 4342 | Hognoul | 0 | house | 399000.0 | 4 | 180 | 1 | 1 | 1 | 25 | 1 | 570 | 680 | 3 | 0 | as new | 5.455639 | 50.680810 | Liège | Wallonie |
| 41524 | 52064 | 4342 | Hognoul | 0 | house | 425000.0 | 3 | 315 | 1 | 0 | 1 | 124 | 1 | 250 | 0 | 3 | 0 | None | 5.455639 | 50.680810 | Liège | Wallonie |
| 41525 | 52065 | 7743 | Obigies | 0 | villa | 390000.0 | 4 | 340 | 1 | 1 | 0 | 0 | 1 | 1500 | 2164 | 4 | 0 | None | 3.364281 | 50.662055 | Hainaut | Wallonie |
| 41526 | 52067 | 3050 | Oud-Heverlee | 0 | house | 420000.0 | 5 | 185 | 0 | 0 | 0 | 0 | 1 | 0 | 465 | 0 | 0 | to be done up | 4.667897 | 50.821768 | Brabant flamand | Flandre |
| 41527 | 52068 | 3050 | Oud-Heverlee | 0 | house | 435000.0 | 4 | 234 | 1 | 0 | 1 | 20 | 0 | 0 | 0 | 3 | 0 | as new | 4.667897 | 50.821768 | Brabant flamand | Flandre |
| 41528 | 52070 | 1472 | Vieux-Genappe | 0 | villa | 475000.0 | 5 | 216 | 1 | 1 | 0 | 0 | 0 | 0 | 1550 | 4 | 1 | as new | 4.401503 | 50.629025 | Brabant wallon | Wallonie |
| 41529 | 52071 | 1472 | Vieux-Genappe | 0 | villa | 475000.0 | 5 | 215 | 1 | 0 | 1 | 0 | 0 | 0 | 1550 | 0 | 1 | good | 4.401503 | 50.629025 | Brabant wallon | Wallonie |
| 41530 | 52072 | 1461 | Haut-Ittre | 0 | villa | 499000.0 | 5 | 275 | 1 | 0 | 1 | 0 | 1 | 0 | 1561 | 4 | 0 | None | 4.296472 | 50.648804 | Brabant wallon | Wallonie |
| 41531 | 52073 | 1761 | Borchtlombeek | 0 | villa | 495000.0 | 4 | 235 | 1 | 0 | 0 | 0 | 1 | 0 | 488 | 4 | 0 | None | 4.136915 | 50.848178 | Brabant flamand | Flandre |
| 41532 | 52075 | 3381 | Kapellen | 0 | house | 485000.0 | 3 | 220 | 0 | 0 | 1 | 19 | 0 | 0 | 1019 | 4 | 0 | good | 4.960878 | 50.887345 | Brabant flamand | Flandre |